A Chromatic Correlation Clustering

نویسندگان

  • FRANCESCO BONCHI
  • ARISTIDES GIONIS
  • FRANCESCO GULLO
  • CHARALAMPOS E. TSOURAKAKIS
چکیده

We study a novel clustering problem in which the pairwise relations between objects are categorical. This problem can be viewed as clustering the vertices of a graph whose edges are of different types (colors). We introduce an objective function that ensures the edges within each cluster have, as much as possible, the same color. We show that the problem is NP-hard and propose a randomized algorithm with approximation guarantee proportional to the maximum degree of the input graph. The algorithm iteratively picks a random edge as a pivot, builds a cluster around it, and removes the cluster from the graph. Although being fast, easy-to-implement, and parameter-free, this algorithm tends to produce a relatively large number of clusters. To overcome this issue we introduce a variant algorithm, which modifies how the pivot is chosen and how the cluster is built around the pivot. Finally, to address the case where a fixed number of output clusters is required, we devise a third algorithm that directly optimizes the objective function based on the alternatingminimization paradigm. We also extend our objective function to handle cases where object’s relations are described by multiple labels. We modify our randomized approximation algorithm to optimize such an extended objective function and show that its approximation guarantee remains proportional to the maximum degree of the graph We test our algorithms on synthetic and real data from the domains of social media, protein-interaction networks, and bibliometrics. Results reveal that our algorithms outperform a baseline algorithm both in the task of reconstructing a ground-truth clustering and in terms of objective-function value.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chromatic Clustering in High Dimensional Space

In this paper, we study a new type of clustering problem, called Chromatic Clustering, in high dimensional space. Chromatic clustering seeks to partition a set of colored points into groups (or clusters) so that no group contains points with the same color and a certain objective function is optimized. In this paper, we consider two variants of the problem, chromatic k-means clustering (denoted...

متن کامل

Enrollment generators, clustering and chromatic numbers

We study the timetable conflict graphs produced by an artificial generator of student enrollments. We find correlations of their chromatic number with their density and clustering coefficient. The work gives evidence that the clustering coefficient is a useful measure of a graph.

متن کامل

Extraction of Scene Text in HSI Color Space using K-means Clustering with Chromatic and Intensity Distance

Text extraction is important step that strongly influences on the final recognition performance. This task is especially challenging in case of scene text which is characterized with wide set of degradations like complex backgrounds, uneven illumination, viewing angle, etc. In this paper we evaluated text extraction based on K-means clustering in HSI color space with chromatic distance and inte...

متن کامل

Chromatic Harmonic Indices and Chromatic Harmonic Polynomials of Certain Graphs

In the main this paper introduces the concept of chromatic harmonic polynomials denoted, $H^chi(G,x)$ and chromatic harmonic indices denoted, $H^chi(G)$ of a graph $G$. The new concept is then applied to finding explicit formula for the minimum (maximum) chromatic harmonic polynomials and the minimum (maximum) chromatic harmonic index of certain graphs. It is also applied to split graphs and ce...

متن کامل

Spatio-Chromatic Adaptation via Higher-Order Canonical Correlation Analysis of Natural Images

Independent component and canonical correlation analysis are two general-purpose statistical methods with wide applicability. In neuroscience, independent component analysis of chromatic natural images explains the spatio-chromatic structure of primary cortical receptive fields in terms of properties of the visual environment. Canonical correlation analysis explains similarly chromatic adaptati...

متن کامل

Feature Analysis of Chromatic or Achromatic Components based on Tensor Voting and Text Segmentation using Separated Clustering Algorithm

This paper presents a new technique for segmenting corrupted text images on the basis of color feature analysis by second order tensors. It is show how feature analysis can benefit from analyzing features using second order tensor with chromatic and achromatic components. Proposed technique is applied to text images corrupted by manifold types of various noises. Firstly, we decompose an image i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015